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Abstract 

Mannose- or mannan-binding lectin (MBL) is 3 member of the collectin protein family, which includes lung 
jsui factanl proteins SP-A and SP-D. Each member consists of similar or identical polypeptide chains with a 
jregion of collagen-like sequence followed by a Otype lectin domain. The polypeptides associate in threes to 
form a subunlt containing a collagen-like helix, with three clustered lectin domains. These subunits associate 
Into larger structures, usually with 12-18 polypeptides. The collectins bind to patterns of neutral sugars on 
surfaces (e.g. of micro-organisms) and mediate effector functions associated with kllHng/phagocytosis. MBL 
is the only collectin which activates complement it resembles In quaternary structure the complement 
protein Clq, which recognizes targets via charge clusters. Binding of MBl to a surface activates MBl- 
associated serine proteases (MASPs) attached to M8L, and MASP-2 activates complement proteins C4 and 
C2. The masps are homologous to the Clq-associated proteases, Clr and C1s. MBL therefore activates 
complement by a mechanism very similar to Ciq, and engages the opsonic activity of complement to clear 
micro-organisms. The serum concentration of MBL is very variable in humans. The variability is largely 
associated with mutations leading to amino acid substitutions in the collagen-like region which decrease 
MBl assembly and stability. Many studies demonstrate that MBL deficiency is associated with susceptibility 
o a range of Infectious and inflammatory diseases. 
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an-t inding lectin (MBL) is a plasma glycoprotein of the 
tinf nnily [1] 7 members of which bind to carbohydrates 
e sux faces of micro-organisms and particulate materials 
rding altered host material, such as apoptotic cells), via 
C-cchninal lectin domains. It is also known as mannan- 
ng protein (MB?) or rnannose-binding lectin, since k 
3 to yeast mannao [2] as well as to mannosc coupled to 
arose [3]. Previous names lor MBL include core-specific 
(C5|L), from its interaction with the core motif of 
nked pligosaccharide [4]> and Ra reactive factor (RaRI-"), 
bind \ to Ra chemotype strains of SalmonvUa [5J. The 
MB L is now used in preference to MBP to avoid 
with identically abbreviated proteins (e.g. major 
iin and myelin basic protein) and to emphasize its 



h character. 



ly peptide chain of secreted MBL is 228 amino 
lond (not including the 20-residue signal peptide), and 
cf a 20-residue 'cysteine-rich' region (containing 
teirus), followed by a collagenous region containing 
Gly-Xaa! -Xaa^ triplets, a c neck' region and then a C- 
nal cLlcium-dependent carbohydrate-binding lectin do- 
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called a carbohydrate-recognition domain (CRD) 
e neck region forms an a-helical coiled-coil scruc- 



whic i probably promotes crlmerizauon of three poly- 
O form the subunit. The t rimer is stabilised by 



tsad; CRD, cnrbohydmie-reiCQgnlilon domain, MBL, irnnnnn-frnriirK} lecUr.; WMP, 
To whom corrfbpondence 5h&ui<J be addressed (e-msll bo&.5lm©t>i<xh-QH-2euk). 



hydrophobic interactions and inter-chain disulphide bonds 
within the N- terminal cysteine- rich region. MBL subunits as- 
semble into larger oligomeric structures forming a 'bunch-of- 
tulips' or sertiform appearance (Figure 1) [10). The common- 
est oligomeric form in humans appears to bea six-subunic (t 8- 
polypeptide chain) form with an overall molecular mas3 of 
about 18 x 25 000 Da. Unusually, disulphide bridging within 
and between sub units is incomplete and variable, so that, 
for example* the common six-subumt form is heterogeneous., 
consisting of a number of isoforms with different disulphide 
bridging, This is evident from SDS/PAGE analysis of 
non-reduced MBL, and comparison with analyses made 
by non-denaturing; hydrodynarnie methods [11,12]. Several 
publications (e.g. (13 7 14J) suggest that native MBL in serum or 
plasma occurs in oligomeric forms of different sizes (ranging 
from one to six subunits in humans),, but it appears likely 
that the six-subumt oligomer is by far the major form [15]; 
smaller oligomers may form on storage or processing of 
plasma. Other proteins with similar quaternary structure 
also show variable polymerization: Clq, for cxamplc T has 
a six-head and a minor two- head form; the collectin SP-A 
(surfactant protein- A) also occurs in forms with one to 
six subunitK [16]. The oligomerizaiion of MBL allows avid 
binding to carbohydrate ligands due to the multiple CRDs 
present; forms with a lower degree of polymerization bind 
less avidly to carbohydrate surfaces, and are also defective in 
activating complement [12]. 

MBL has no N-linked carbohydrate, but its collagenous 
region contains hydroxylaced lysine and proline residues, 
which have O-linked glycosyiation. 
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Figure i j Strutture 4nd organization of MBL 

(a) MBL c insists of on N-lerrni'nal cross-linking region, a collagen-like region, a neck region and a CRD. These 25-kOa 
pglypepiidps inmerize to form a structural subunit (b) ihai i5 found in serum as oligomers (c) [26], 
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calcium Ions for binding to its carbohydrate 
emm ion directly participates in ligand binding, 
rdin? tion bonds Co the 3- and 4-OH groups 
suga \ Wei? et al. [17] demonstrated this and 
f|>r the 3~ and 4-OH groups to be in the 
the sugar ring structure, This requirement 
bonding to amino-acid side chains and 
of MBL to TV -acetyl-D- glucosamine, 
mannosaminc, L-fucosc and glucose [10], 
ar structures on microbial surfaces, not 
mammalian surfaces, can bind with high 
s which T within the trimeric head of each 
4f>~55 A apart , The specificity of MBL in 
patterns on surfaces (pathogen-associated 
or PAMPs) relies therefore on the identity 
idc, its exposure on the surface and 
ugar residues- The spacing between the in- 
bin one subunit, and the spacing of subunit 
ch other, are the determining factors (for 
see [IS]). 

have shown that MBL binds to clinically 
sms including HIV-1 [19], influenza 
} coccus aureus [20], Neisseria meningitidis 
albicans [22] and Leishmanial major [23]. 



MBL serujn levels 

MBL scrum lcve,s arc extremely variable, ranging from 
almost 0 to >5 \x.\ /ml in healthy humans. MBL deficiency 
is common (5% or more of the population, depending 
on the threshold concentration used to define deficiency) 



and is associated with severe and repeated infections in 
infants, Soo thill and Harvey [24] described a group of infants 
with recurrent pyogenic infections > whose serum failed to 
opsonize S&ccharomyces cerevisiae. A similar opsonic defect 
was found in about 5% of an apparently healthy adult control 
population. Super and colleagues [25] linked this common 
opsonization deficiency with low MBL expression. 

The variation in MBL levels can be attributed to three 
structural mutations of the MBL gene which are likely to 
result in defective polymerisation, interacting with several 
polymorphisms in the MBL promoter region which influence 
the level of expression. The structural mutations occur at 
high frequency (generally 15% or greater cumulative allele 
frequency in most populations studied) and are single base 
changes in codons 52, 34 and 57 of exon 1. The changes 
are as follows: Arg-52 to Cys (R.52C, MBL D variant), 
Gly-54 co Asp (G54D, MBL B variant) and Gly-57 to Glu 
(G57E, MEL C variant). The A variant is wild-type MBL. 
The 13 and C variants have disrupted Gly-Xaa l -Xaa^ repeats 
of the collagenous region resulting in an altered capacity 
to form the collagen triple helix. The D variant introduces 
an additional cysteine residue and so may disrupt oligomer 
formation by generation of additional disuJphide bonds. 
Within the promoter region of the MBL gene nucleotide 
substitutions at positions —550, —221 and +4 give rise 
to H/L, Y/X and P/Q variants that influence the MBL 
expression, HY and LY hap lo types arc associated with high, 
while LX haplotypes are associated with low, MBL plasma 
levels. Due to linkage disequilibrium, only seven haplo types 
(HYP A, LYQA, LYPA, LXPA, LYPB, LYQC and HYPD) 
are commonly found. Determination of the genetic variability 
in MBL has been performed principally by M.W. Turner and 
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1 



2 i overview of the complement system 

jhe complement cascade can be activated by three d'strna pathways on recognition of a target. All three pelhwgys lead 
Itimalely to the formation of mukicomponenl serine proteases (C3 convertases) C4b2a and C3bBb, which allows these 
nree routes to converge to the activation of C3. The covalent binding of this prolein to activating surfaces Is responsible ioi 
filiating most of the effects of the system. The role of ficolins is described in [49]. mac, membrane attack complex. 
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and their colleagues, and is described in recent 
.,27]. 



MBi and complement 

MB l. is trie only member of the coil cc tin family of proteins 
to ac-civfir.it the complement system. MBL can ace directly 
as nr. opsonin, by binding to carbohydrates on pathogens, 
then interacting with MBL receptors on phagocytic ceils (for 
furtrer discussion of receptors, see [28]), However, it can 
also trigger the opsonic activity of complement, resulting 
in d ?posii ion of C3b/iC3b on targets, and stimulation of 
phagocytic uptake via the C3 receptors, CR1, CR.3 and CR.4. 

The complement system is a major mediator of innate 
immune defence and contributes to many innate immune 
system functions including inflammation,, opsonization and 
lysis It consists of more than 30 proteins, both soluble 
in scrum kind bound to cell membranes. The complement 
system cah be activated via three pathways (classical, lectin 
or akerna 1 ivc) upon recognition of PAMPs (Figure 2). 

Recognition by the first two of these pathways leads to 
ivjation of a cascade of proteases with cleavage of the 
lem< nt components C2 and O* leading to the formation 
pre cease complex C4b2a, which cleaves C3 into C3a 
C3b, C3b is a central component of the complement 
di c to its many binding interactions [29]. It binds 
lend} to co mplement -activating surfaces where it {and 
-eakdpwn product iC3b) act as opsonins for phagocytes, 



and C3b Can also combine with a C3 convcrtasc to form a C5 
convertase. The C5 convertase initiates the assembly of the 
membrane attack complex (MAC) consisting of complement 
components C5-C9, which forms pores in the lipid bilayers 
of its targcts 1 potentially causing cell lysis [30]. C3a acts as 
an anaphylotoxin mediating inflammatory responses along 
with C4a and C5a, other complement component frag- 
ments released during the complement cascade (Figure 2). 

The pattern- recognition molecule of the classical pathway 
is Clq, found completed with two pro tease proenzymes, Clr 
and Cls in the CI complex (Clqr 7 s 2 ) [31,32]. Clq binds to 
immunoglobulin G or M in immune complexes and, without 
need for immunoglobulin, to a wide range of charge patterns 
on target surfaces [33]. Upon binding of C 1 q to a target, Clr 
auto activates and cleaves Cls to its active form. Cls then goes 
onto cleave C4, releasing C4a and a larger fragment, C4b. C4b 
binds to C2 and Cls cleaves the C2 to release C2b, leaving 
C2a completed with C4b to forma C3 convertase (C4b2a), 

Clq is similar in quaternary structure co MBL, and when 
MBL was first shown to activate complement, it was thought 
to act just like Clq, forming a complex with Clr and Cls [34]. 
This can indeed happen in f uitro 1 but in vivo MBL instead 
interacts preferentially with homologues of Clr arid Cls, 
called MBL-associatetl serine proteases (MASPs) [14,35 7 36]. 

The lectin pathway of complement is initiated through 
the binding of MBL or ficolins to carbohydrate structures 
present on a wide range of micro-organisms including 
bacteria, viruses, fungi and parasites, MBL can also recognize 
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MA5P are composed of an N-terminaf complement'Subcomponenl Cir/Cis-like domain (CUB) domain, followed by 
pidemaf growth lacioHike domain (EOF), a second CUB domain, two complement control protein domains (CCPs) and 
:ermlrpl serine prolease domain [AS], Three MASPs have been described so far, MASP-1, masp-2 and more recently 
1], maSP-3 is an oUerna live-splicing product from the MA5P-1 gene, carrying the same hc?.vy chain bu t a different 
ease domain. MApl? consists of Ihe lirsl two domains of MASP-2 ano a unique C-lerrninal sequence (EQ5L). 

and -3 are synthesized as proenzymes of 91, 78 and 101.5 kOa respectively. These molecular masses are 
from iheir amino-acid sequences and by assuming that all the potential N-glycosylation sites are occupied by a 
te of 3.5 kDa. Potential N- and O-glycasvlatign sites (ivl and 0) are shown. 
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rures on antibodies, including the common 
variant IgG- GO [37] and polymeric IgA 
lins are found in scrum complexed with 
7 and a small 19 kDa protein (MAp29). 
tru ecu rally similar to Clr and Cls with 
mposicion (Figure 3). The mechanisms for 
have yet to be fully determined but upon 
;argct 7 MASP-1 and M ASP-2 arc activated 
ike CI rand Cls), and MASP-2 cleaves C4 
C3 convertase C4b2a [39,40], 
pathway is activated by a mechanism 
m the classical and lectin pathways (for 
It reacts to targets according to surface 
charge and neutral sugar. It also has 
endent (IgG— immune complexes) and 
ent modes of activation. 



iplex of the classical pathway* which 
c rnogeneous and of fixed stoiehiometry 
Complexes with MASPs arc likely to be 



heterogeneous. When MBL-MASP complexes are captured 
on mannan-coaced surfaces, MASP-1 is more abundant than 
MASP-2, and they arc not present in a fixed ratio [42,43]. 
The additional components MASP-3 (an alternatively spliced 
produce from the MASP-1 gene [14]) and MApl9 [45] are 
of low abundance. These MBL-MASP complexes are able to 
cleave C2 and C4 to form the C3 converxaae, C4bC2a. This 
activity has been demonstrated to be a property of MASP-2 
but not of MASP-1 [39], MASP-1 and MASP-2 appear to 
function separately and there is evidence to suggest that they 
bind to MBL independently, and activate independently of 
each other [12,42,44].. 

MASP-1 and MASP-2 arc synthesized as proenzymes of 91 
and 78 kDa respectively [35 } 36] (Figure 3). They are activated 
through cleavage of a conserved Arg— lie peptide bond in 
a linker region connecting the serine protease domain to 
the rest of the enzyme. This cleavage results in an A- chain (the 
first live domains and the linker region) linked by a disulphide 
bond to a B-chain, the serine protease domain. The MASPs 
have a try ps in-like (basic specificity) active sice consisting 
of the catalytic residues His, Asp and Set with Asp as 
the substrate- specificity residue at — (> from the active- site 
Scr 
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the proteases Cls and Clr that form heteroiet- 
wi ih the classical pachway component Clq (Clr 2 s ; ) 
MAS M and MASP-2 do nor appear to function as one 
Dahl et al. [14] indicated that different MBL oligomers 
prC5( nt in scrum with differing MASP compositions, 
fou* d MASP-1 and MApl9 associated with smaller 
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men and MASP-2 and MASP-3 with larger oiigo- 
significance of this finding with respect to their 
functions has yet to be determined. As noted 
cleaves and activates C4 and C2 [39,40], and 
like Cls. However there is no definitive natural 
rate identified forMASP-1 orMASP-3 [39,40]. MASP^l 
likef Clr, as it does nor seem to influence MASP-2 
while Clr activates Cls. MA5P-1 was initially 
jjred i|r> cleave C3 directly [46,47] but this now seems 
tudics by "Wong et al. [42], Ambrus et al, [48] 
Haje|a et ah [40] have shown that cleavage of the 
ally inactive form of C3 [C3(H 2 0)] is easily 
and occurs more than 100-fold faster than that 
/tically active C3 ('live' C3) k The observation of 
live C3 is probably due to spontaneous hydrolysis 
xo form C3(H 2 0)j which is then cleaved [40]. 
|>f C3(H;>0) is unlikely to have biological signi- 
C3(H 2 0) has very low abundance in the blood, 
man MASP-l has a thrombin-Iikc substrate spec- 
cloving the coagulation proteins Factor XIII (plasma 
gluta ninasc) and fibrinogen [40]. These reactions arc 
fasdpr than observed for C3 and may indicate that 
and Factor XIII are significant substrates in vivo. 
che substrates of MASP-l and MASP- 3 have yet to 

lored. 
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